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goal  of  this  project  is  to  build  on  our  preliminary  data  that  suggests  that  patients  with  ET  have  distinct  platelet  transcript 
profiles  that  differ  from  those  of  normal  platelets.  The  three  main  hypotheses  to  be  tested  are:  (1)  patients  with  ET  have 
mRNA  profiles  that  are  distinct  from  those  of  normal  controls;  (2)  these  differences  can  be  used  to  elucidate  the  molecular 
basis  of  ET;  and  (3)  these  differences  can  be  used  to  differentiate  ET  from  other  causes  of  thrombocytosis  (ET  diagnostics). 
Completion  of  the  specific  aims  as  outlined  below  should  (/)  provide  considerable  insight  into  the  molecular  basis  of  ET,  (//) 
assist  with  molecular  diagnostics,  and  (///)  help  to  devise  rational  approaches  for  pharmacological  intervention. 
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I.  INTRODUCTION.  Human  blood  platelets  play  critical  roles  in  normal  hemostatic  processes  and  pathologic 
conditions  such  as  thrombosis  (i.e.  cardiovascular  disease  and  stroke),  vascular  remodeling,  inflammation,  and 
wound  repair.  Despite  the  biological  importance  of  platelets  and  their  intact  protein  synthetic  capabilities, 
remarkably  little  is  known  about  platelet  mRNAs.  The  pathogenesis  of  essential  thrombocytosis  (ET),  a  disease 
of  platelet  number  and  function,  is  poorly  understood  at  the  molecular  level.  The  main  goal  of  this  project  is  to 
build  on  our  preliminary  data  that  suggests  that  patients  with  ET  have  distinct  platelet  transcript  profiles  that 
differ  from  those  of  normal  platelets.  The  three  main  hypotheses  to  be  tested  are:  (1)  patients  with  ET  have 
mRNA  profiles  that  are  distinct  from  those  of  normal  controls;  (2)  these  differences  can  be  used  to  elucidate 
the  molecular  basis  of  ET;  and  (3)  these  differences  can  be  used  to  differentiate  ET  from  other  causes  of 
thrombocytosis  (ET  diagnostics).  Completion  of  the  specific  aims  as  outlined  below  should  (/)  provide 
considerable  insight  into  the  molecular  basis  of  ET,  (//)  assist  with  molecular  diagnostics,  and  (///)  help  to  devise 
rational  approaches  for  pharmacological  intervention.  These  three  major  hypotheses  are  the  basis  for  the 
following  specific  aims: 

Specific  Aim  I.  Validate  differences  in  microarray  profiles  between  normal  and  ET  patients 

Specific  Aim  II.  Establish  the  positive  and  negative  predictive  values  of  these  profiles 

Specific  Aim  III.  Develop  a  pilot  ET  diagnostic  microarray  chip  and  evaluate  a  transcriptome-based  approach 
to  the  diagnostics  of  ET. 

II.  BODY. 


A.  APPROACH  (Specific  Aims  1  and  2):  Microarray  analysis  was  used  to  study  the  molecular  basis  of 
essential  thrombocythemia  using  highly-purified  platelets  isolated  from  peripheral  blood  (20  mL)  or  by 
apheresis.  Differences  were  validated  by  quantitative  PCR  (Q-PCR)  and/or  protein  analyses  as  necessary. 

B.  RESULTS:  Initial  one-way  ANOVA  identified  170  genes  that  were  differentially  expressed,  the  majority  of 
which  (141)  were  up-regulated  in  ET  platelets;  only  29  genes  were  down-regulated  in  ET  compared  to  normal 
platelets.  Analysis  of  the  smaller  subset  of  platelet-restricted  genes  demonstrated  that  only  13  genes  were 
differentially-expressed  (12  up-regulated,  1  down-regulated)  in  ET.  A  disproportionate  number  of  upregulated 
genes  encoded  proteases  or  protease  inhibitors  {HPSE,  MMP1,  SERPINI1),  a  class  of  proteins  known  to  be 
associated  with  tumor  invasiveness  and  metastases.  The  single  down-regulated  gene  {HSD17B3)  was 
present  in  all  normal  samples  and  absent  in  all  ET  platelets.  HSD17B3  belongs  to  an  extended  family  of 
l/BHSDs  retaining  oxido-reductase  activity  toward  discrete  substrates,  and  encodes  an  enzyme  (type  3  17  - 
hydroxysteroid  dehydrogenase)  previously  described  as  testis-specific.  This  enzyme  is  known  to  catalyze  the 
penultimate  step  in  testosterone  biosynthesis.  Using  a  functional  assay  of  testosterone  generation,  we 
demonstrated  that  platelets  retained  17pHSD3  activity,  with  nearly  10%  of  the  capacity  found  in  mouse  testis, 
providing  evidence  for  the  first  non-testicular  source  of  this  enzyme.  Transcripts  for  two  additional  members  of 
this  family  were  found  in  human  platelets,  one  of  which  {HSD17B12)  was  upregulated  in  the  initial  cohort  of 
patients  studied.  Subsequent  Q-PCR  results  were  entirely  concordant  for  all  individuals  studied, 
demonstrating  that  17BHSD12:17BHSD3  transcript  ratios  reliably  distinguished  ET  from  normal  patient 
platelets  in  all  samples  studied  to  date  (N=20;  6  apheresis  samples,  14  peripheral  blood  samples;  p  <0.0001). 
Furthermore,  these  differential  patterns  of  HSD17B  expression  appeared  unrelated  to  the  development  of 
thrombocytosis  perse,  but  rather,  were  restricted  to  the  ET  phenotype. 

NB:  This  data  is  summarized  in  the  manuscript  D.  Gnatenko,  et.  al.  (Thromb.  Haemost.,  94:412-21, 2005). 

Changes  to  Specific  Aims  1  and  2:  NONE 
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C.  ONGOING  STUDIES  (Specific  Aim  3):  A  diagnostic  platelet  oligonucleotide  microarray  chip 


Human  blood  platelets  play  critical  roles  in  normal  hemostatic  processes  and  pathologic  conditions  such  as 
thrombosis  (heart  attacks  and  stroke),  vascular  remodeling,  inflammation,  and  wound  repair.  We  have 
identified  a  restricted  list  of  platelet-expressed  genes,  and  used  these  for  fabrication  of  a  focused,  intermediate 
density  oligonucleotide  gene  chip.  For  probe  selection,  we  analyzed  microarray  data  isolated  from  (/)  normal 
patient  peripheral  blood  leukocytes  (WBC),  (//)  patients  with  essential  thrombocythemia  ([ET];  i.e.  those  with 
high  platelet  counts),  and  (///)  normal  platelets  (NP).  We  hypothesized  that  such  a  strategy  would  maximize  the 
identification  of  informative  genes  for  chip  fabrication.  The  final  list  was  established  by  including  the  following 
cohorts:  (/)  a  group  of  platelet-restricted  genes  with  no  expression  in  leukocytes  {N=126y,  a  preliminary  group 
of  discriminatory  genes  distinguishing  between  thrombohemorrhagic  ET  phenotypes  {N=71)\  the  list  of  genes 
with  platelet  expression  >  leukocyte  expression  by  10-fold  {N=285y,  and  the  list  of  genes  with  leukocyte 
expression  >  platelet  expression  by  10-fold  {N=43)  [leukocyte  contamination  control].  After  removal  of 
duplicates,  the  final  list  contains  432  genes  which  clearly  co-segregate  by  cell-type  (Fig.  1). 

Fig.  1.  Gene  rank-intensity  piots.  Normalized  data  from 
individual  samples  (WBC=3,  NP=5;  ET=6)  were  analyzed  by 
one-way  ANOVA  using  parametric  testing  to  identify  a  432- 
member  gene  list.  The  top  panei  demonstrates  the  platelet- 
restricted  genes  (N=389)  while  the  iower  panei  demonstrates 
the  leukocyte-restricted  genes  (N=43).  Note  the  clear  difference 
in  the  expression  patterns  between  the  two  groups,  and  the 
variability  in  expression  among  the  various  platelet  populations 
(reflecting  to  some  degree  the  different  platelet  populations  (ET 
versus  normal). 

Since  platelets  have  limited  mRNA,  we  have  also  developed  and 
refined  methods  for  reproducible  mRNA  amplification,  as  an 
adjunct  for  chip  development.  Starting  with  20  ng  of  RNA 
(readily  obtainable  from  10  ml  of  whole  blood),  we  have 
generated  sufficient  quantities  of  RNA  that  maintain  excellent 
amplification  fidelity  for  transcript  profiling.  After  hybridization  to  our  fabricated  chip,  we  established  correlation 
coefficients  for  the  group  of  platelet  genes  (N=319)  that  were  expressed  in  unamplified  vs.  amplified  microarray 

experiments.  These  data  were  generated  by  calculating  ratios 
against  the  co-hybridized  reference  RNA,  and  subsequently 
plotted  (Fig.  2).  For  this  subset  of  evaluable  genes,  the  R- 
value  was  0.67  with  high  statistical  significance  (p<0.0001). 

We  propose  that  this  chip  can  be  developed  commercially  for 
platelet  diagnostics  (anticipated  1  million  tests  annually  in  the 
US),  and  for  establishment  of  microarray  profiles  for 
thrombosis-risk  susceptibility  in  common  diseases  such  as 
stroke  and  heart  disease;  additional  applications  include 
potential  identification  of  novel  genes  as  drug  targets  for 
modulating  platelet  function  (gene  and  drug  discovery). 
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Changes  in  Specific  Aim  3:  None.  In  the  next  funding  period  we  will  continue  our  validation  studies  of  the 
chip  in  distinguishing  ET  from  other  causes  of  (reactive)  thrombocytosis 

III.  KEY  RESEARCH  ACCOMPLISHMENTS. 


•  First  demonstration  that  platelet  microarray  studies  can  be  applied  to  a  human  platelet  disorder 

•  First  demonstration  that  ET  platelets  have  a  profile  distinct  from  normal  platelets 

•  First  demonstration  that  a  diagnostic  assay  may  be  available  to  distinguish  ET  from  normal  platelets 
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•  Initial  fabrication  and  characterization  of  a  platelet-diagnostic  oligonucleotide  chip 

IV.  REPORTABLE  OUTCOMES. 

A.  Manuscripts  accepted  and/or  published 

•  Gnatenko,  D.,  L.  Cupit,  E.  Huang,  A.  Dhundale,  P.  Perrotta,  and  W.  Bahou.  Platelets  express  steroidogenic 
17p-hydroxysteroid  dehydrogenases:  distinct  profiles  predict  the  essential  thrombocythemic  phenotype, 
Thromb.  Haemost.,  94:412-21, 2005. 

•  Watson,  S.,  W.  Bahou,  D.  Fitzgerald,  W.  Ouwehand,  A.  K.  Rao,  and  A.  Leavitt.  Mapping  the  platelet 
proteome:  a  report  of  the  ISTH  platelet  physiology  subcommittee.  J.  Thr.  Haemost.  3:2098-2101, 2005. 

•  Gnatenko,  D.  and  W.  Bahou.  Advances  in  platelet  transcriptomics.  Trans.  Med.  Hemotherapv,  in  press. 

•  Bahou,  W.  Platelet  genetic  disorders,  in  Genomics  and  Clinical  Medicine.  D.  Kumar,  editor,  Oxford 
University  Press,  2006. 


B.  Invited  Reviews  (Pending  publication) 

Gnatenko,  D.,  P.  Perrotta,  and  W.  Bahou,  Advances  in  platelet  proteomics:  one-half  the  story.  Blood. 
submitted 

C.  Oral  presentations 

•  “Global  platelet  profiling”,  Platelets  Symposium  (Jerusalem,  Israel;  May,  2006) 

•  “Transcript  profiling  of  essential  thrombocythemic  platelets",  Dept.  Defense  Peer  Reviewed  Medical 
Research  Program  (San  Juan,  Puerto  Rico;  May,  2006) 

•  “Progress  and  limitations  of  platelet  transcript  profiling”.  International  Society  of  Thrombosis  and 
Hemostasis  (Sydney,  Australia;  2005) 


V.  CONCLUSION:  We  have  made  considerable  progress  in  all  aims  of  the  research,  specifically  in  (/) 
providing  initial  proof-of-principal  that  ET  has  unique  mRNA  transcript  profiles,  (//)  that  these  profiles  can  be 
used  to  develop  potential  diagnostic  tests  for  the  disease,  and  {Hi)  in  identifying  a  novel  steroidogenic  pathway 
in  normal  and  platelets.  During  the  subsequent  funding  cycle,  we  will  continue  validation  of  these  initial 
observations  by  recruiting  a  broader  cohort  of  subjects,  and  initiating  preliminary  studies  to  dissect  the  role  of 
steroidogenic  pathways  in  megakaryocyte  proliferation  and/or  proplatelet  formation  (thrombopoiesis). 

VI.  REFERENCES.  SEE  SECTION  IV.  Reportable  Outcomes  {above) 

VI.  APPENDICES. 


•  Gnatenko,  D.,  L.  Cupit,  E.  Huang,  A.  Dhundale,  P.  Perrotta,  and  W.  Bahou.  Platelets  express  steroidogenic 
17p-hydroxysteroid  dehydrogenases:  distinct  profiles  predict  the  essential  thrombocythemic  phenotype, 
Thromb.  Haemost.,  94:412-21, 2005. 

•  Watson,  S.,  W.  Bahou,  D.  Fitzgerald,  W.  Ouwehand,  A.  K.  Rao,  and  A.  Leavitt.  Mapping  the  platelet 
proteome:  a  report  of  the  ISTH  platelet  physiology  subcommittee.  J.  Thr.  Haemost.  3:2098-2101, 2005. 


VII.  SUPPORTING  DATA.  As  outlined  in  Figures  1  and  2  {above)  and  in  two  appended  articles. 
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Summary 

Human  blood  platelets  have  important,  regulatory  functions  in 
diverse  hemostatic  and  pathological  disorders,  including  vascu¬ 
lar  remodeling,  inflammation,  and  wound  repair.  Microarray 
analysis  was  used  to  study  the  molecular  basis  of  essential 
thrombocythemia,  a  myeloproliferative  disorder  with  quanti¬ 
tative  and  qualitative  platelet  defects  associated  with  cardiov¬ 
ascular  and  thrombohemorrhagic  symptoms,  not  infrequently 
neurological.  A  platelet-expressed  gene  (HSDI7B3)  encoding 
type  3  l7(3-hydroxysteroid  dehydrogenase  (previously  char¬ 
acterized  as  a  testis-specific  enzyme  catalyzing  the  final  step  in 
gonadal  synthesis  of  testosterone)  was  selectively  down-regu¬ 
lated  in  ET  platelets,  with  reciprocal  induction  of  the  type  1 2 
enzyme  (HSDI7BI2).  Functional  I7[3-HSD3  activity  cor- 

Keywords 

Platelets,  hydroxysteroid  dehydrogenases,  microarray,  essential 
thrombocythemia 


responding  to  ~I0%  of  that  found  in  murine  testis  was  demon¬ 
strated  in  normal  platelets. The  induction  of  HSDI7BI2  in  ET 
platelets  was  unassociated  with  a  concomitant  increase  in  an¬ 
drogen  biosynthesis,  suggesting  distinct  functions  and/or  sub¬ 
strate  specificities  of  the  types  3  and  12  enzymes.  Application  of 
a  molecular  assay  distinguished  ET  from  normal  platelets  in  20 
consecutive  patients  (p  <0.0001).  These  data  provide  the  first 
evidence  that  distinct  subtypes  of  steroidogenic  1 7[3-HSDs  are 
functionally  present  in  human  blood  platelets,  and  that  the  ex¬ 
pression  patterns  of  HSDI7B3  and  HSDI7BI2  are  associated 
with  an  uncommon  platelet  disorder  manifest  by  quantitative 
and  qualitative  platelet  defects. 


Thromb  Haemost  2005;  94:  412-21 


Introduction 

Circulating  blood  platelets  are  anucleate  although  they  retain 
small  amounts  of  megakaryocyte-derived  mRNAs  and  a  fully 
functional  protein  biosynthetic  capacity  (1).  Essential  thrombo¬ 
cythemia  (ET)  represents  a  myeloproliferative  disorder  subtype, 
characterized  by  increased  proliferation  of  megakaryocytes, 
elevated  numbers  of  circulating  platelets,  and  considerable 
thrombohemorrhagic  events,  not  infrequently  neurological  (2). 
ET  is  seen  with  equal  frequency  in  males  and  females,  although 
an  additional  female  incidence  peak  at  age  30  may  explain  the 
apparent  higher  disease  prevalence  in  females.  The  molecular 
basis  of  ET  remains  unestablished  although  historically  it  has 


been  considered  a  “clonal”  disorder  (3).  Causative  mutations 
have  been  identified  in  the  thrombopoietin  gene,  but  these  ap¬ 
pear  to  be  uncommon,  and  restricted  to  rare  individuals  with 
familial  thrombocythemia  (4).  Other  than  the  exaggerated  pla¬ 
telet  volume  and  inconsistent  platelet  aggregation  abnormalities 
evident  in  subsets  of  ET  platelets,  no  functional  or  diagnostic  test 
is  currently  available  for  ET  and  it  remains  a  diagnosis  of  exclu¬ 
sion  (5). 

The  utility  of  gene  expression  profiling  for  the  molecular 
classification  of  human  cancer  is  well-documented,  although  its 
applicability  to  poorly-understood  myeloproliferative  disorders 
such  as  essential  thrombocythemia  remains  unestablished.  Pre¬ 
vious  work  from  this  laboratory  demonstrated  the  feasibility  of 
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platelet  profiling  using  apheresis  techniques  (1,6),  prompting  us 
to  adapt  this  method  for  the  molecular  study  of  ET.  We  now  dem¬ 
onstrate  that  human  blood  platelets  express  a  previously  de¬ 
scribed  (testis-specific)  enzyme  (type  3  17[3-hydroxysteroid 
dehydrogenase)  known  to  catalyze  the  penultimate  step  in  testos¬ 
terone  interconversion;  furthermore,  distinct  expression  patterns 
of  genes  encoding  this  enzyme  are  associated  with  the  ET  pheno¬ 
type.  These  data  identify  establish  a  novel  link  between  platelet 
function  and  hormone  steroidogenic  pathways  that  may  be 
causally  implicated  in  platelet-mediated  thrombotic  events,  and/ 
or  platelet  production. 

Methods 

Patient  selection  and  characterization 

Patients  were  enrolled  from  the  larger  pool  of  patients  referred  to 
the  Division  of  Hematology  for  evaluation  of  thrombocytosis. 
All  patients  provided  informed  consent  for  an  IRB  (Institutional 
Review  Board)-approved  protocol  completed  in  conjunction 
with  the  Stony  Brook  University  Hospital  General  Clinical  Re¬ 
search  Center.  Standard  hematological  criteria  were  followed  for 
the  diagnosis  of  essential  thrombocythemia,  reactive  throm¬ 
bocytosis,  or  other  myeloproliferative  disorders  (7,  8).  Both  sex- 
and  age-distribution  paralleled  prevalence  figures  for  ET,  with  a 
M:F  ratio  of  1:2.3,  and  age  at  diagnosis  ranging  23-78  years.  Pla¬ 
telet  counts  at  the  time  of  blood  isolation  ranged  from  normal 
(reflecting  treatment)  to  l,724,000/|il;  patient  utilization  of  pla¬ 
telet-lowering  drugs  (i.e.  hydroxyurea,  analgrelide,  or  untreated) 
was  recorded  at  the  time  of  platelet  isolation  and  purification 
(refer  to  Table  1  for  detailed  patient  characteristics). 

Platelet  molecular  studies 

Platelets  were  obtained  by  apheresis  or  from  peripheral  blood 
(10  ml),  and  were  isolated  essentially  as  previously  described, 
utilizing  gel-filtration  and  CD45-coupled  magnetic  micro-beads 
for  leukocyte  immunodepletion  (1).  The  final  platelet-enriched 
product  contained  no  more  than  3-5  leukocytes  per  1X10^  pla¬ 
telets;  peripheral  blood  leukocytes  from  three  healthy  donors 
were  isolated  as  previously  described  (1).  Pure  cellular  pellets 
were  resuspended  in  10  ml  of  Trizol  reagent  (Invitrogen,  Carls¬ 
bad,  CA),  transferred  into  DEPC  (diethylpyrocarbonate)-treated 
Corex  tubes,  and  serially  purified  and  precipitated  using  isopro¬ 
panol  (9).  Platelet  total  mRNA  quantification  and  integrity  were 
established  using  an  Agilent  2100  Bioanalyzer,  and  mean  pla¬ 
telet  RNA  concentrations  between  the  two  groups  were  nearly 
identical:  ET  platelets  contained  ~0.8  ±  0.2  fg/platelet  while  nor¬ 
mal  platelet  RNA  concentrations  were  ~0.6  ±  0.3  fg/platelet. 
Quantitative  reverse  transcription  (RT)-PCR  was  performed 
using  fluorescence-based  real-time  PCR  technology  (TaqMan 
Real-Time  PCR,  Applied  Biosystems,  Foster  City,  CA).  Oligo¬ 
nucleotide  primer  pairs  were  generated  using  Primer3  software 
(www-genome.wi.mit.edu),  designed  to  generate  ~200-bp  PCR 
products.  i/SD7 7Ri-specific  primers  were:  forward  (5’-3’ 
AAATGTGATAACCAAGACTGC  [bp  755-775];  reverse  (5’-3’ 
CTTGGTGTTGAGCTTCAGGTA  [bp  956-936];  HSD17B12- 
specific  primers  were:  forward  (5 ’-3’  TGAATACTTTTGGAT- 
GTTCCTGA  [bp  496-519]);  reverse  (5’-3’ AGTCTTGGTTG- 
CAGAATAGATGGT,  [bp  634-611];  7/5D77S7  7 -specific 


primers  were:  forward  (5’-3’  TGGATATAAAATGAAAGCG 
CAATA  [bp  1067-1090];  reverse  (5’-3’:  ATCAGCTTTTG 
GCTAAAGAACAAG  [bp  1265-1242];  F7-specific  primers 
were:  forward  (5 ’-3’:  TCCTGTTGTTGGTGAATGG  [bp  734- 
753];  reverse  (5’-3’:  GTACGTGCTGGGGATGATG  [bp933- 
915];  P-actin-specific  primers  were  as  previously  described  (1). 
Purified  platelet  mRNA  (4  |ig)  was  used  for  first  strand  cDNA 
synthesis  using  random  hexamers  and  Superscript  II  reverse 
transcriptase  (Invitrogen,  Carlsbad,  CA).  For  RT-PCR  analysis, 
the  RT  reaction  was  equally  divided  among  primer  pairs  and 
used  in  a  40-cycle  PCR  reaction  for  each  target  gene  using  three- 
step  cycles  of  94°C  for  30  seconds,  55°C  for  30  seconds,  and 
72°C  for  1  min;  mRNA  levels  were  quantified  by  monitoring 
real-time  fluorimetric  intensity  of  SYBR  green  I.  Relative 
mRNA  abundance  was  determined  from  triplicate  assays  per¬ 
formed  in  parallel  for  each  primer  pair,  calculated  and  standard¬ 
ized  to  P-actin  as  previously  described  (6, 1 0).  For  some  patients, 
high  molecular-weight  genomic  DNA  was  isolated  from  periph¬ 
eral  blood  leukocytes  for  PCR-based  amplification  and  sequenc¬ 
ing  of  exons  and  exon-intron  boundaries  (11). 

Gene  expression  profiles  were  completed  using  the  HU  1 3  3  A 
GeneChip  containing  a  22,  283  probe  set  (Affymetrix,  Santa 
Clara,  CA).  Total  cellular  RNA  (5.8  |ig)  was  used  for  cDNA  syn- 


Table  I :  Patient  characteristics. 


ID 

Age' 

Sex 

Diag- 

nosis 

Sample 

Source^ 

Platelet 

count^ 

(X  lO’/L) 

Treat¬ 

ment'’ 

Genetic 

analysis^ 

ETI 

31 

F 

ET 

B,P 

1,308 

H 

M,P 

ET3 

49 

M 

ET 

B,P 

565 

H 

M,P 

ET4 

33 

M 

ET 

B,P 

1,515 

N 

M,P 

ET5 

23 

F 

ET 

B,P 

1,566 

N 

M,P 

ET6 

37 

F 

ET 

B,P 

539 

A 

M,P 

ET7 

65 

F 

ET 

B 

991 

A 

P 

ET9 

64 

F 

ET 

B 

588 

A 

P 

ETIO 

57 

M 

PV/ET6 

B 

945 

H 

P 

ETII 

71 

F 

ET 

B 

477 

H 

P 

ETI2 

40 

F 

ET 

B 

456 

N 

P 

ETI4 

65 

F 

ET 

B 

853 

A 

P 

ETI5 

64 

F 

ET 

B 

511 

H 

P 

ETI8 

78 

M 

ET 

B 

329  (940) 

H 

P 

ETI9 

77 

F 

ET 

B 

488 

H 

P 

ET20 

69 

M 

ET 

B 

345  (1,063) 

A 

P 

ET27 

50 

F 

PV/ET6 

B,P 

1,724 

H 

M,P 

ET28 

29 

F 

RT7 

B 

402 

N 

P 

ET29 

66 

F 

ET 

B 

495 

N 

P 

ET3I 

70 

M 

ET 

B 

347  (880) 

A 

P 

ET32 

50 

F 

ET 

B 

513 

N 

P 

ET33 

75 

F 

ET 

B 

641 

H 

P 

'Age  at  diagnosis 

^Sample  source:  B  -  peripheral  blood;  P  -  plateletpheresis 

^Platelet  count  at  time  of  sample  collection  (normal  range  1 50  -  350);  note  that  for  patients  with 
normal  platelet  counts  at  time  of  blood  isolation,  the  platelet  counts  in  parentheses  are  highest 
pre-treatment  determinations 

^Refers  to  treatment  at  the  time  of  platelet  isolation;  A  -Anagrelide;  H-  Hydroxyurea;  N  -  Not 
treated  5M  -  Microarray;  P  -  Quantitative  RT-PCR 
^Originally  given  the  diagnosis  of  polycythemia  rubra  vera  (PV) 

^Secondary  (post-splenectomy)  thrombocytosis 
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thesis  using  Superscript  Choice  system  (Life  Technologies, 
Rockville,  MD)  and  an  oligo  (dT)  primer  containing  the  T7 
polymerase  recognition  sequence,  followed  by  cDNA  purifi¬ 
cation  using  phenol/chloroform  extraction  and  ethanol  precipi¬ 
tation  (!)./«  vitro  transcription  was  completed  in  the  presence  of 
biotinylated  ribonucleotides  using  a  BioArray  HighYield  RNA 
Transcript  Labeling  Kit  (Enzo  Diagnostics,  Farmingdale,  NY), 
and  after  metal-induced  fragmentation,  10  |ig  of  the  biotinylated 
cRNA  was  hybridized  to  the  GeneChip  array  for  16  hours  at 
45°C.  After  washing,  the  cRNA  was  detected  with  streptavidin- 
phycoerythrin  (Molecular  Probes,  Eugene,  OR)  and  analysis 
completed  using  a  Hewlett-Packard  Gene  Array  Scanner.  The 
fluorescence  intensity  of  each  probe  was  quantified  using  Af- 
fymetrix  GeneChip  software  (MAS  version  5.0),  calculated  as  an 
average  difference  for  each  gene  set  obtained  from  16  to  20 
paired  (perfectly  matched  and  single  nucleotide-mismatched) 
25-bp  oligonucleotides.  The  software  is  designed  to  exclude 
“positive  calls”  in  the  presence  of  high  average  differences  with 
associated  high  mismatch  intensities. 

Bioinformatic  and  statistical  analyses 

Microarray  data  were  analyzed  and  visualized  using  GeneSpring 
(version  7.0)  software  (Silicon  Genetics,  a  subsidiary  of  Agilent 
Technologies,  Palo  Alto,  CA).  Megakaryocyte  expression  data 
were  obtained  from  the  National  Center  for  Biotechnology 
(NCBI)  Gene  Expression  Omnibus  (GEO)  database  (http:// 
www.ncbi.nlm.nih.gov/projects/geo/;  platform  GPL96),  using 
profiles  obtained  from  thrombopoietin-stimulated  normal  and 
ET  bone  marrow  CD34^  cells  differentiated  in  liquid  culture  for 
14  to  16  days  (12).  CD34+-derived  megakaryocytes  (Mks)  were 
purified  using  a  megakaryocyte-specific  anti-CD4 1  a  monoclon¬ 
al  antibody,  followed  by  microarray  analysis  using  the  identical 
HU  133 A  GeneChip.  Data  were  normalized  by  dividing  each 
measurement  by  the  50th  percentile  of  all  measurements  in  that 
sample,  and  each  gene  was  divided  by  the  median  of  its  measure¬ 
ments  in  all  samples.  Normalized  median  ratios  of  individual 
genes  were  filtered  for  presence  across  arrays,  and  selected  for 
expression  levels  as  detailed.  Prior  to  unsupervised  hierarchical 
clustering  of  the  uncentered  Pearson  correlation  similarity  ma¬ 
trix,  the  platelet  microarray  data  were  filtered  for  gene  ex¬ 
pression  across  phenotypic  cohorts,  defined  as  those  genes  pres¬ 
ent  or  marginal  in  a  minimum  of  80%  of  platelet  samples  (yield¬ 
ing  2,906  transcripts).  A  subset  of  genes  was  culled  from  the 
2,906-gene  list  to  specifically  delineate  those  transcripts 
uniquely  expressed  in  platelets;  this  platelet-restricted  subset 
(N=  1 26)  was  delineated  by  removing  genes  expressed  in  3/3  leu¬ 
kocyte  microarrays.  A  non-parametric  analysis  of  variance  test 
(ANOVA)  was  performed  to  identify  differentially  expressed 
genes  using  the  Benjamini  and  Hochberg  method  to  lower  the 
false  discovery  rate  (p  <  0.01).  All  statistical  analyses  were  com¬ 
pleted  using  SPSS  (Statistical  Package  for  Social  Sciences,  ver¬ 
sion  1 1.5)  software. 

Functional  1 7p-HSD  studies 

Functional  studies  for  platelet  17[3-HSD3  activity  were  com¬ 
pleted  using  gel-filtered  platelets  (GFP).  Briefly,  1.5  X  10^  pla¬ 
telets  were  solubilized  and  freeze-thawed  in  HSD  buffer  contain¬ 
ing  {20  mM  Tris-HCl  (pH  8.0),  5  mM  MgClz,  50  mM  NaCl,  5% 


Glycerol,  10  mM  DTT,  1.5  mM  NAD  (D-5755,  Sigma  Co.,  St. 
Louis,  MO)},  and  the  reaction  started  by  addition  of  1  |il  (12.7 
pmol)  of  [l,2,6,7-^H(N)]-testosterone  (specific  activity  78.5  Ci/ 
mM)  (Dupont/NEN)  to  equivalent  protein  aliquots.  A  30-  or 
90-minute  reaction  was  allowed  to  proceed  at  25°  C,  and 
quenched  at  -20°C  in  the  presence  of  cold  testosterone  (1  mg/ 
ml)  and  androstenedione  (1  mg/ml).  Steroid  extraction  was  per¬ 
formed  twice  using  150  |il  of  ethyl  acetate,  both  fractions  were 
combined  and  air-dried  under  a  vacuum.  Androstenedione  and 
testosterone  were  separated  by  thin  layer  chromatography  (Un¬ 
iplate  Silica  Gel  GF  plates  (250  pm),  Analtech  Inc,  Newark,  DE) 
using  4:1  (v/v)  chloroform:ethyl  acetate.  Discrete  steroid  frac¬ 
tions  were  visualized  and  extracted  from  silica  gels  using  etha¬ 
nol,  and  quantified  by  liquid  scintillography.  Purified  hydroxys- 
teroid  dehydrogenase  from  Pseudomonas  testosteroni  (Sigma 
Co.,  St.  Louis,  MO)  diluted  to  10  mg/ml  in  HSD  buffer  served  as 
control  for  some  experiments.  Mouse  testis  extract  prepared 
from  a  C57/B16  mouse  served  as  standard  for  platelet  1 7[3-HSD3 
quantification,  and  was  prepared  by  homogenization  in  3  ml  of 
HSD  buffer.  After  centrifugation  at  5000  g  for  5  minutes,  super¬ 
natants  were  gel-filtered  in  HSD-equilibrated  Centrisep  spin 
columns  prior  to  use;  protein  quantification  of  all  samples  was 
determined  by  optical  density  as  previously  described  (13). 

Results 

The  genetic  profiles  of  highly-purified  apheresis  platelets  iso¬ 
lated  from  6  ET  patients  (4  females,  2  males)  and  5  normal, 
healthy  controls  demonstrated  distinctly  different  molecular  sig¬ 
natures  (Fig.  1).  ET  platelets  collectively  demonstrated  higher 
numbers  of  expressed  transcripts  compared  to  normal  controls, 
but  considerably  less  than  the  transcript  numbers  generally 
found  in  nucleated  cells  (1).  Of  the  genes  classified  as  marginal 
or  present  in  a  minimum  of  4  microarrays,  ET  patient  samples 
expressed  an  average  of  3,562  transcripts  compared  to  1,668  for 
normal  controls  (compared  with  ~10,500  transcripts  identified 
in  all  three  leukocyte  microarrays).  More  stringent  analyses  (i.e. 
marginal  or  present  in  all  of  the  arrays  within  a  single  group)  ex¬ 
tended  these  differences,  with  1,840  transcripts  expressed  in  ET 
platelets  versus  1,086  transcripts  expressed  in  platelets  from 
healthy  controls  (p  <0.03).  Thus,  while  bone  marrow  mega¬ 
karyocyte  expansion  is  known  to  accompany  the  thrombocy- 
themic  phenotype  (2),  this  cellular  proliferation  is  also  associ¬ 
ated  with  a  nearly  45%  increase  in  overall  gene  transcription. 

An  unsupervised,  hierarchical  clustering  algorithm  was  used 
to  group  normal  and  ET  platelet  genes  on  the  basis  of  similarities 
of  gene  expression  (Fig.  la,b).  This  direct  comparison  against  a 
genetically  normal  platelet  pool  highlighted  genes  that  consist¬ 
ently  distinguish  normal  from  diseased  platelets  (14).  Normal  or 
ET  platelets  have  genetic  profiles  distinct  from  leukocytes,  and 
notable  differences  between  normal  and  ET  platelets  are  clearly 
evident.  To  gain  further  insight  into  the  specific  changes  associ¬ 
ated  with  the  ET  phenotype  -  and  to  identify  genes  that  could 
discriminate  between  morphologically  indistinguishable  ET  and 
normal  platelets  -  several  methods  of  computational  analyses 
were  applied.  Initial  one-way  ANOVA  identified  170  genes  that 
were  differentially  expressed,  the  majority  of  which  (141)  were 
up-regulated  in  ET  platelets;  only  29  genes  were  down-regulated 
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in  ET  compared  to  normal  platelets  {see  below).  Functional 
cluster  analysis  of  this  limited  set  of  differentially-expressed 
genes  (Fig.  2)  demonstrated  that  genes  involved  in  adhesion  and 
catalytic  activity  represented  the  largest  subgroups,  although  a 
sizable  number  of  genes  (35%)  remained  unclassified. 


Because  of  inherent  difficulties  in  analysis  of  microarray 
data  sets  (15),  we  re-analyzed  the  data,  now  computing  t-stat- 
istics  of  ET  versus  normal  platelets  for  each  gene;  the  t-values 
were  ranked  by  absolute  magnitude  (thereby  incorporating  inter¬ 
sample  variability  in  expression  ratios);  they  were  then  ranked  by 


a 


0  <■ 


Relative  transcript  expression 


Figure  I:  Molecular  signature  of  normal  and  essential  throm- 
bocythemic  platelets.  Gene  expression  profiles  from  14  experimental 
samples  (eleven  apheresis  donors  [5  normal,  NLA-E;  6  patients  with  es¬ 
sential  thrombocythemia  (ET)],  or  3  normal  leukocyte  [WBCA-C]  do¬ 
nors)  are  displayed,  a.  Relationships  between  the  experimental  samples 


are  displayed  as  dendrograms,  in  which  the  pattern  and  length  of  the 
branches  depict  sample  cohort  relatedness  among  the  experimental 
groups,  b.  Unsupervised  hierarchical  clustering  using  the  2,906-gene  set 
demonstrates  the  distinct  variation  in  gene  expression  pattern  among 
defined  cohorts;  each  row  represents  a  single  gene. 
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the  magnitude  of  the  test  statistic  numerator,  a  measure  of  the 
biological  difference  in  expression  ratios.  By  applying  a  rigid 
5-fold  difference  in  pair-wise  expression  as  the  cut-off,  we  ident¬ 
ified  163  genes  that  were  up-regulated  in  ET,  but  only  a  small 
number  (5)  of  down-regulated  genes  (Fig.  3a).  To  further  pare 
this  list,  the  analyses  were  repeated  using  the  database  of  genes 
whose  expression  was  restricted  to  platelets.  This  subset  of  pla¬ 
telet-restricted  genes  was  delineated  by  excluding  genes  from 
the  2,906-gene  list  that  were  expressed  in  all  3  leukocyte  arrays, 
leaving  only  13  genes  that  were  differentially-expressed  (12  up- 
regulated,  1  down-regulated  in  ET)(Fig.  3b).  Of  the  small  subset 
of  platelet-restricted,  differentially  expressed  genes  identified  by 
this  analysis,  7  were  also  in  the  top  40  list  identified  by  the  one¬ 
way  ANOVA,  establishing  an  independent  layer  of  validation  to 
these  findings  (Fig.  3c);  while  only  17%  of  the  170  differen¬ 
tially-expressed  genes  were  under-expressed  in  ET  platelets  by 
one-way  ANOVA,  down-regulated  genes  were  over-represented 
in  the  platelet-restricted  gene  list,  accounting  for  36%  (9/25)  of 
the  total.  Of  the  top  20  over-expressed  genes,  3  encoded  pro¬ 
teases  or  protease  inhibitors  (HPSE,  MMPl,  SERPINIl),  a  class 
of  proteins  well-associated  in  tumor  invasiveness  and  cancer 
metastases  (16).  Interestingly,  matrix  metalloproteinases  have 
recently  been  shown  to  mediate  megakaryocyte  transendothelial 
migration  and  proplatelet  formation,  although  this  effect  appear¬ 
ed  restricted  to  MMP-9  to  the  exclusion  of  MMP-2  (17);  while 
the  latter  report  did  not  specifically  study  MMP-1,  our  data 
would  implicate  MMP-1  as  having  a  specific  or  ancillary  role  in 
the  basement  degradative  process  known  to  accompany  propla¬ 
telet  formation. 

A  single  platelet-restricted  gene  HSD17B3  (encoding  the 
type  3  17[3-hyrdroxysteroid  dehydrogenase  [17[3-HSD3])  was 
expressed  in  all  normal  platelet  arrays,  and  uniquely  under-ex- 
pressed  in  ET  compared  to  normal  platelets  (Fig.  3b).  The  large 
family  of  steroid  dehydrogenases  retain  oxidoreductase  activity 
in  a  wide  range  of  biological  processes,  although  the  17[3-HSDs 
uniquely  function  in  the  formation  and  inactivation  of  all  active 
androgens  and  estrogens,  with  substrate-product  interconversion 
regulated  by  the  oxidative  state  of  the  NADP/NAD(P)Ft  cofac¬ 
tors  (18).  The  HSD  type  3  enzyme  is  generally  regarded  as  testes- 
specific,  although  rare  SAGE  (19)  tags  have  been  identified  in 
CGAP  tissues  including  brain  (7  X  10“^),  skeletal  muscle  (9.3  X 
10“^),  and  prostate  (1.6  X  10“^)  (20).  The  17(3-HSD3  enzyme 
specifically  mediates  the  catalytic  interconversion  involving 
4-androstenedione  and  testosterone,  and  molecular  defects  of 
the  P[SD1 7B3  gene  are  causally  implicated  in  male  pseudoher¬ 
maphroditism  (21).  While  steroidogenic  pathways  are  incom¬ 
pletely  characterized  in  platelets,  previous  data  have  demon¬ 
strated  that  megakaryocytes  (Mk)  express  the  glucocorticoid  re¬ 
ceptor,  and  that  both  Mk  and  platelets  selectively  express  es¬ 
trogen  receptor  [3  (ER[3)  and  androgen  receptor  mRNA  and  pro¬ 
tein,  to  the  exclusion  of  ER  a  or  progesterone  receptor  (22).  Fur¬ 
thermore,  Mk  express  functional  3[3-FlSD  that  is  known  to  cata¬ 
lyze  the  essential  step  in  the  transformation  of  5-pregnen-3[3-ol 
and  5-androsten-3P-ol  steroids  into  the  corresponding 
A‘*-3-keto-steroids,  i.e.  progesterone  as  well  as  the  precursors  of 
all  androgens,  estrogens,  glucocorticoids  and  mineralocorti- 
coids.  Indeed,  Mk-derived  estradiol  triggers  megakaryocyte  pro¬ 
platelet  formation  in  vitro,  a  process  that  is  blocked  through  in¬ 
hibition  of  3[3-FISD  activity  (23). 


Figure  2:  Functional  clusters  of  differentially  expressed  genes 
from  14  experimental  samples  (see  legend  to  Fig.  I).  Gene  Ontol¬ 
ogy  classifications  are  delineated  for  the  differentially-expressed  gene  set 
(N=I70)  as  identified  by  one-way  ANOVA  (p<0.0l). 


To  date,  genes  eneoding  twelve  types  of  17[3-FISD  enzymes 
have  been  described,  although  the  type  6  and  9  genes  have  been 
only  characterized  in  rodents  (24).  Oligonucleotide  probes  spe¬ 
cific  for  all  ten  HSD  1 7B  genes  are  represented  onthe  Affymetrix 
FIU133a  gene  ehip;  probes  for  type  6  and  9  are  not  represented. 
Examination  of  the  microarray  data  demonstrated  that  platelet 
HSD17B  transcript  expression  was  limited  to  three  isoforms: 
HSD17B3,  HSD17B11,  and  HSD17B12.  While  the  HSD17B11 
mean,  normalized  signals  between  the  normal  and  diseased  co¬ 
horts  were  low-level  and  not  statistically  different,  there  was  a 
striking  change  in  the  pattern  of  HSDl  7B3  and  HSDI 7B12  ex¬ 
pression  between  ET  and  normal  platelets  (Fig.  4  a-c).  Absence 
of  HSDl  7B3  transcript  expression  was  evident  in  all  6  ET  pa¬ 
tients,  changes  that  occurred  concomitantly  with  elevated  tran¬ 
script  levels  of  HSD17B12  in  the  same  patient  subgroup.  In 
contrast,  expression  of  HSDl  7B3  in  normal  platelets  was  accom¬ 
panied  by  negligible  to  low-level  HSD17B12  expression.  To 
compare  these  platelet  profiles  with  those  from  ET  and  normal 
megakaryocytes,  we  downloaded  Mk  profiles  from  the  NCBI 
GEO  database  for  bioinformatic  analyses  using  these  published 
data  sets  (12).  ET  Mks  demonstrated  the  identical  patterns  as 
those  found  in  ET  platelets,  i.e.  absence  of  HSDl  7B3  expression 
with  enhanced  expression  of  the  HSDl  7B 12  gene.  Somewhat 
unexpectedly,  these  same  patterns  were  also  evident  in  normal 
Mks;  while  these  comparisons  are  of  potential  relevanee,  they 
are  nonetheless  limited  by  the  exogenous  cytokine  supplemen¬ 
tation  required  for  ex  vivo  Mk  differentiation. 

To  validate  and  extend  these  findings,  we  developed  a  quanti¬ 
tative  RT-PCR  (qRT-PCR)  assay,  and  applied  this  assay  to  the 
original  ET  cohort  and  an  expanded  cohort  of  normal  controls, 
specifically  collected  to  exclude  potential  gender-bias  in 
HSDl  7B  gene  expression.  These  results  confirmed  and  paral¬ 
leled  those  found  by  microarray,  demonstrating  ~4.5-fold  greater 
HSDl  7B3  transcript  levels  in  normal  platelets  (compared  to  ET, 
p  <0.001)  and  concomitant  ~27-fold  greater  HSD17B12  tran¬ 
script  expression  in  ET  platelets  (compared  to  normal,  p  <0.03); 
these  reciprocal  changes  amount  to  an  aggregate  ~2-log  change 
in  intracellular  HSDl 7B3:HSD17B12  transcript  levels  between 
normal  and  ET  platelets  (Fig.  4d).  Sinee  the  qRT-PCR  data  estab¬ 
lished  an  absolute  deerease  in  HSDl  7B3  transcript  level  in  ET 
platelets,  preliminary  genomic  analyses  of  HSDl  7B3  were  com¬ 
pleted  in  4  of  the  6  ET  patients.  The  1 1  exons  and  intron-exon 
boundaries  were  amplified  and  sequenced,  resulting  in  identifi¬ 
cation  of  a  single  heterozygous  A  insertion  [not  involving  the 
splice  junction  site  (25)]  in  the  first  intron  of  one  patient  (ETl). 
Thus,  there  was  no  evidence  that  a  small  deletion  or  missense 
mutation  affecting  HSD17B3  transcript  stability  was  causally 
implicated  in  reduced  HSDl  7B3  transcript  expression  (data  not 
shown). 

The  distinct  patterns  of  HSD17B  expression  identified  by 
microarray  and  confirmed  by  qRT-PCR  were  then  extended  to  a 


416 


Gnatenko  et  al.:  Platelets  express  steroidogenic  1 7|3-hydroxysteroid  dehydrogenases 


8FRS9 

ATPeV1B2 

GRSFI 

1S-8«p 

RA57 

STFSA 

8M 

EIF3$1 

C8NK1A1 

BZW1 

YUEILt 

KARS 

Re  MSI 

8SR1 

PAPOLA 

EIFS 

CREBL2 

ATP2C1 

UeE2L3 

COMT 

fCORSC 

CTHNAt 

SEC14L1 

AT0X1 

UeB2U 

R$N 

FU107I}2 

RABEXS 

SPRS2 

SPRS2 

MAPK14 

PCNXLS 

6RIM19 

MP2 

TRAPM 

RNPe 

RMEB2 

SUSP 

VtMJI 

MATR3 

ecu 

MAPK1 

EIFS 

TTOAC 

PU214S2 

PROS1 

APG2 

APeA2BP 

HTATIP2 

KTATIP2 

P1IMPI 

F« 

OPS 

AK1 

T8PL1 

SCOCF« 

SERPINE2 

MTHF8 

MYM2 

CACMA1A 

F7 

R6M9 

COL7A1 

PPIG 

RABSA 


ATPSV1B2 

STXBP2 

GRIM19 

TXNOC 

NOUPBS 

DJ«71Niej 

OPYO 

RAB7 

RNT 

ATP2C1 

CLCN3 

UPF3A 

MGCSirS 

SEC14t1 

ATOX1 

AJ(R7A2 

CTN8 

CTN6 

NAPA 

XK 

AXR7A2 

aj2oass 

RABSA 

8LC12A6 

8LC12A5 

CACMA1A 


RJ214S2 

OPS 

C03S 

MAPKt 

8C0GF-8 

rroAE 

PC6R2C 

niAP»s 

MAPK14 

GMNLIA 

PTPRA 

RCPS 

DKGS 

HUM3R84 


COL7A1 

SBEVIW1 

PR081 

SBRPIME2 

Set 


CTNNA1 

OCTNS 

MATR8 

COL7A1 

COL11A2 


Structural 


WBC 


ET 


Normal 


FNTe 

AK1 

M6ST3 

PACe4 

NP 

AKR7A2 

AR02 

PCYT18 

MMPI 

UBC203 

FLJ107(I2 

ATP6V1B2 

MAN2A1 

UBe2L9 

GRIM1S 

PTPRA 

LOCS1064 

UBC3L3 

COM1 

V0U1 

RKEB2 

EIFS 

UBE38 

PCNP 

AKR7A2 

N00T4 

NS017B12 

HP8E 

PAM 

HIP2 

MAPK14 

CDS2 

MAPK1 

M6C317S 

pioe 

KARS 

CSNKIAI 

USP2S 

OPYD 

UBSIC 

C8NK1A1 

YMeiL1 

PAPOLA 

eiPS 

RAB? 

NNT 

ATP2CI 

NOUEBB 

MTHFS 

F7 

KLK1:KLKR 

HSD17B3 

PP16 

RABSA 


CRGBU 

HIP2 

TRAPSS 

OTF3A 

TBPL1 

HTATIPS 

HTATIP2 

RBM» 


BTGI 

N2AV 

CDS9 

TXN 

NOFIP1 

0CTN4 

KIAAM77 

NBT-B 

ARF4 

AfiF4 

pcfyin 

PCMT1 

RNFIO 

LAPTM4B 

C6I>26 

43511  s  at 

LAPTM48 

PC326 

KIAAM04 

CAV2 

LOCS11B6 

FLJ1127S 

AF1300$0 

HTATIP2 

HTATIP2 

FU10342 

SORO 

HSU7ft2S2 

KIAA0974 

cocie 

STXBPS 

FUlOUB 

tOC220S86 

MOC30S7 

PTC013 

MOC19S56 

HOCL 

RBAF6O0 

TOOiN2 

RNF6 

LOCS41B3 

iOC$4103 

HSPCm 

GMITM 

M0M1 

C2(Mm1111 

KIAA12S4 

PSEN2 

AK024eB2 

RBUS3 

M6C2716S 

0KP2pS470IM 

R0BO4 

Ml>C3A 

SEMABA 

M6C3B821 

eXorfd 

RPL37A 

M6C2B816 

PHC9 


Relative  transcript  expression 


417 


Gnatenko  et  al.:  Platelets  express  steroidogenic  1 7|3-hydroxysteroid  dehydrogenases 


larger  cohort  of  20  ET  patients  (6  original  ET  patients  and  14 
newly-studied  individuals),  now  uniformly  analyzed  using  pe¬ 
ripheral  blood  as  the  starting  source  for  platelet  analysis  (bypass¬ 
ing  the  need  for  cumbersome  apheresis  technique  used  in  the 
original  cohorts).  The  qRT-PCR  results  were  entirely  concor¬ 
dant  for  all  individuals  studied,  demonstrating  that 
1 7BHSD12:17BHSD3  transcript  ratios  reliably  predicted  the  ET 
phenotype  in  all  patients  studied  to  date  (p  <0.0001)  (Fig.  5). 
Furthermore,  these  differential  patterns  of  HSDI 7B  expression 
appeared  unrelated  to  the  development  of  thrombocytosis  per  se, 
but  rather,  were  restricted  to  the  ET  phenotype.  This  observation 
is  based  on  the  results  of  four  individuals:  ET28  who  has  second¬ 
ary  thrombocytosis  and  ratios  predictive  of  the  normal  pheno¬ 
type;  and  ET18,  ET20,  and  ET31  with  aggressively-treated  ET 
and  normal  platelet  counts  who  maintain  1 7BHSD12:1 7BHSD3 
transcript  ratios  predictive  of  ET.  The  causes  of  thrombocytosis 
are  varied,  and  larger  cohorts  of  patients  with  etiologically  di¬ 


verse  causes  for  thrombocytosis  will  need  to  be  studied  for  con¬ 
firmation,  and  for  potential  discriminatory  value.  Nonetheless, 
these  initial  data  suggest  that  the  intracellular  signals  regulating 
MK/platelet  1 7BHSD  expression  may  be  associated  with  the  pa¬ 
thophysiological  mechanism(s)  of  the  ET  phenotype. 

To  confirm  that  platelets  retained  functional  17[3-HSD3  ac¬ 
tivity  (and  to  compare  this  activity  between  normal  and  ET  pla¬ 
telets),  we  quantified  the  oxidative  conversion  of  testosterone  to 
4-androstenedione.  Entirely  consistent  with  the  gene  expression 
data,  normal  platelets  retained  1 7P-HSD3  activity,  providing  (to 
our  knowledge)  the  first  evidence  that  non-testicular  sources  re¬ 
tain  functional  capacity  in  the  penultimate  step  of  androgen  bio¬ 
synthesis;  furthermore,  the  platelet-derived  17[3-HSD3  activity 
was  not  inconsequential,  providing  nearly  10%  of  the  capacity 
found  in  testis  (Fig.  6).  Finally,  in  the  initial  cohort  of  patients 
studied,  ET  platelets  demonstrated  total  17[3-FISD3  activity  that 
was  not  statistically  different  from  that  found  in  normal  platelets. 
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Figure  3:  Identification  of  most  significant  differentially-ex- 
pressed  ET  genes.  Gene  selection  was  calculated  for  each  gene  by  ap¬ 
plying  a  5-fold  cut-off  and  a  computed  t-statistic,  using  the  aggregate 
(N=2906)  gene  list  (a),  or  the  list  of  platelet-restricted  (N=  1 26)  genes 
(b).  c.  The  list  of  40  genes  (20  over-expressed,  20  under-expressed)  de¬ 
monstrating  greatest  differential  expression  by  one-way  ANOVA  is  pres¬ 
ented,  ranked  by  the  ratio  of  the  mean  group  normalized  signals 


(ETNormal);  shaded  column  delineates  genes  with  highest  expression  in 
ET  whereas  unshaded  column  refers  to  genes  with  lowest  expression. 
Genes  are  delineated  by  gene  symbol,  or  if  unassigned,  are  specified  by 
GenBank  ID  in  parentheses.  Genes  in  bold  are  platelet-restricted;  2/ 1 2 
genes  in  Panel  b  {ATOXI ,  LAPTM4B)  are  not  depicted  because  they  are 
present  in  Panel  c. 
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Figure  4: Transcript  analysis  of  platelet-expressed  HSDIJBs. 

Normalized  microarray  values  for  individual  patients  (Panels  a,  b)  or  the 
normalized  aggregate  means  by  group  (c)  are  shown.  In  Panels  a 
(HSD/763)  and  b  (HSD/76/2),  expression  levels  are  log2-transformed, 
such  that  negative  numbers  reflect  decreased  expression  compared  to 
the  normalized  mean  of  1 4  chips  (6  ET,  5  normals,  3  WBC);  for  individ¬ 
ual  patients,  delineation  of  transcript  expression  using  Affymetrix  MAS 
5.0  software  is  specified  (P  -  present;  M  -  marginal;  A-  absent).  The  mega¬ 
karyocyte  (Mk)  data  in  (a)  and  (b)  were  downloaded  from  the  GEO  plat¬ 


form  GPL96  database,  and  analyzed  using  GeneSpring  software,  d. 
Quantitative  RT-PCR  was  completed  on  the  original  cohort  of  6  ET  pa¬ 
tients  and  a  new  cohort  of  lO  normal  controls  (5  males,  5  females)  using 
HSDI7B3-,  HSDI7BI  /-,  HSDI7BI2-,  or  F7-specific  oligonucleotide 
primers  (platelet  coagulation  factor VII  (F7)  is  primarily  endocytosed 
from  plasma  with  negligible  to  no  platelet  mRNA  expression  (34), 
thereby  establishing  the  lower  limit  of  assay  sensitivity  [  I .  I  X  1 0“^  +  2.6 
X  1 0"^  ng/ng  actin,  not  shown]).  *p  <0.03;  **p  <0.00 1 . 


Thus,  the  high  level  expression  of  HSD17B12  transcript  in  ET 
platelets  is  unassociated  with  overall  enzymatic  capabilities  in 
androgen  biosynthesis.  To  date,  the  potential  role  of  1 7[3-HSD  1 2 
in  human  steroidogenesis  remains  unknown,  although  our  data 
would  suggest  that  its  substrate  specificity  is  distinct  from  that  of 
17[3-HSD3.  Phylogenetic  analysis  of  the  recently  identified 
HSD17B3  and  HSD17B12  orthologs  from  the  zebrafish  Danio 
rerio  (26)  suggest  an  evolutionarily  conserved  and  potentially 
important  physiological  role  fox  HSDl  7B12,  although  its  precise 
function  remains  speculative  and  offers  little  insight  to  the  func¬ 
tion  of  its  human  ortholog. 


Discussion 

Our  data  provide  the  first  evidence  that  distinct  subtypes  of  the 
steroidogenic  17[3-HSDs  are  functionally  expressed  in  human 
blood  platelets,  and  that  the  reciprocal  expression  patterns  of 
HSD17B3  and  HSD17B12  are  distinctly  associated  with  a  rare 
platelet  disorder  manifest  by  quantitative  and  qualitative  platelet 
defects.  In  all  ET  patients  studied  to  date,  a  simple  and  reliable 
assay  predicts  the  ET  phenotype,  providing  a  potentially  diag¬ 
nostic  molecular  marker  for  this  disease;  to  date,  we  have  insuf¬ 
ficient  evidence  that  this  assay  can  distinguish  ET  from  reactive 
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Figure  5:  Genetic  assays  of  normal  and  ET  platelets,  a.  qRT-PCK 
was  completed  on  platelets  Isolated  by  routine  phlebotomy  (10  ml)  from 
ET  or  normal  controls,  using  HSD  /  76-specific  primers  as  outlined  in  Fig. 
4d.The  (log2)  ratio  of  HSDI7BI2:HSDI7B3  transcript  expression  is  de¬ 
picted  for  individual  patients,  categorized  by  platelet  count  ( I  O’/L)  at 
time  of  platelet  isolation;  Category  I:  0  -  350;  Category  II:  351-675; 


thrombosis.  Given  the  broad  etiologies  for  thrombocytosis,  a 
more  comprehensive  study  is  currently  being  designed  for  large 
cohort  analysis  and  applicability. 

What  is  the  biological  relevance  of  17[3-HSD  expression  in 
platelets,  and  is  this  causally  associated  with  the  molecular  de¬ 
fect  regulating  megakaryocytopoiesis  and/or  proplatelet 
formation?  Platelet  production  is  complex,  but  is  clearly  affected 
by  endogenous  or  exogenous  sex  hormones.  Androgens  improve 
platelet  counts  in  patients  with  distinct  types  of  thrombocytope¬ 


Category  III:  676-999;  Category  IV:  >1,000  (refer  to  Table  I  for  detailed 
patient  characteristics).  Note  that  three  Category  I  ET  patient  had  nor¬ 
malized  platelet  counts  with  treatment  at  the  time  of  analysis,  and  re¬ 
tained  an  HSDI7BI2:HSDI7B3  ratio  predictive  of  the  ET  phenotype;  the 
single  patient  with  secondary  thrombocytosis  (RT)  studied  to  date  had  a 
normal  ratio. 


nia  (27),  and  diminished  thrombopoiesis  is  evident  in  castrated 
mice,  with  correction  upon  testosterone  supplementation  (28). 
Furthermore,  murine  Mk  retain  the  machinery  for  estradiol  syn¬ 
thesis  in  vitro,  and  the  secreted  hormone’s  ability  to  bind  es¬ 
trogen  receptor  (3  triggers  proplatelet  formation  in  an  autocrine 
manner  (23).  Although  we  did  not  characterize  HSD17B  ex¬ 
pression  in  Mk,  diminished  platelet  HSDl  7B3  transcripts  (with 
exaggerated  HSD17B12  expression)  presumably  reflects  com¬ 
parable  changes  in  Mk.  While  HSD17B3  transcript  is  down- 


Figure  6:  Functional  I7P-HSD  assays  in  platelets.  Lysates  of  pla¬ 
telets  Isolated  from  3  normal  or  3  ET  patients  were  used  for  deter¬ 
mination  of  functional  1 7(3-HSD3  activity  at  distinct  time  points,  and  ex¬ 
pressed  as  standardized  mean  +  SEM  (control  is  HSD  buffer  alone).  In 
Panel  b,  curves  generated  for  30-minute  (•)  or  120-minute  (A) 


I7-(3HSD3  assays  (inset)  were  used  as  standards  for  comparative  deter¬ 
mination  of  platelet  I7|3-HSD3  assays  performed  In  parallel.  Data  are  the 
mean  +  SEM  from  6  determinations;  note  that  0.1  testis  equivalent  units 
correspond  to  10%  of  the  activity  found  in  protein  equivalent  of  mouse 
testes. 
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regulated  in  ET  platelets,  the  changes  are  not  as  pronounced  as 
those  of  HSD17B12  where  transcript  expression  is  ^25-fold 
higher  than  in  normal  platelets.  Despite  this  exaggerated  in¬ 
crease,  our  data  demonstrate  that  HSDl  7B12  induction  is  unas¬ 
sociated  with  enzymatic  changes  in  total  platelet  17[3-HSD3  ac¬ 
tivity.  Thus,  the  function  and  substrate  specificity  of  human 
17P-HSD12  appears  distinct  from  that  of  17[3-HSD3,  and  eluci¬ 
dation  of  its  substrate(s)  may  provide  further  insights  into  the 
roles  of  17(3-HSDs  in  platelet  and/or  Mk  functions. 

Human  platelets  express  both  androgen  and  estrogen  [3  recep¬ 
tors,  and  platelet  function  is  known  to  be  modulated  by  gender 
differences  (29),  the  menstrual  cycle  (30),  and  exogenous  testos¬ 
terone  (31).  Similarly,  hormonal  replacement  in  postmenopausal 
females,  or  oral  contraceptive  use  in  menstruating  females  are 
known  to  predispose  to  thrombotic  diseases  (32,  33),  although 
the  mechanism(s)  of  this  effect  remain  unclear.  These  effects 
could  be  mediated  at  either  genomic  (Mk  transcriptional  activity 
regulated  by  hormonal/receptor  binding)  or  nongenomic  (non- 
transcriptional)  levels,  although  an  interplay  at  both  levels  is 


equally  plausible.  The  evidence  that  human  blood  platelets  retain 
17P-HSD3  activity,  express  distinct  subtypes  of  17P-HSD 
enzymes,  and  demonstrate  altered  HSD17B  expression  patterns 
in  a  disorder  known  to  be  association  with  thrombohemorrhagic 
risk,  provides  novel  insights  into  the  interplay  between  sex  hor¬ 
mones,  platelet  function,  and  vascular  diseases,  both  cerebro- 
and  cardiovascular. 
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Summary.  Proteomic  technology  has  the  potential  to  trans¬ 
form  the  way  we  analyze  platelet  biology,  through  the 
determination  of  platelet  protein  composition  and  its  modifi¬ 
cation  upon  stimulation  and  with  disease.  We  are  a  considerable 
way  from  achieving  these  goals,  however,  because  of  significant 
limitations  in  current  methodology.  It  is  therefore  important  to 
consider  the  extent  to  which  these  aims  can  be  met  and  the  way 
that  proteomic  data  should  be  presented  and  used.  These  issues 
are  discussed  in  the  present  paper  by  the  Platelet  Physiology 
Subcommittee  of  the  ISTH  Scientific  Standardisation  Commit¬ 
tee  (SSC).  It  is  recommended  that  proteomic  information  be 
combined  with  data  from  other  experimental  approaches  to 
establish  a  database  on  protein  expression  and  function  in 
platelets. 


Introduction 

The  proteomic  revolution  has  been  driven  by  the  ability  of  mass 
spectrometry  techniques  to  identify  and  map  partial  protein 
amino  acid  sequences  and  thereby  enable  predictions  of  protein 
identification.  Potentially,  this  approach  could  lead  to  mapping 
of  the  full  protein  composition  of  a  cell,  including  levels  of 
protein  expression  and  sites  of  post-translation  modification, 
e.g.  phosphorylation.  Importantly,  proteomics  can  further  be 
used  to  investigate  how  these  events  are  altered  upon  stimu¬ 
lation  and  with  disease.  This  is  vital  information  on  which  to 
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build  our  understanding  of  platelet  biology,  with  the  ultimate 
goal  of  defining  the  function  of  every  platelet  protein. 

There  are  however  considerable  hurdles  to  be  overcome  in 
realizing  these  aims  and  a  significant  potential  for  errors  to 
accrue  with  regard  to  protein  identification  and  modification. 
The  present  article  discusses  the  reasons  underlying  these  issues, 
and  considers  the  way  in  which  proteomic  information  should 
be  presented  and  used.  The  need  for  caution  and  independent 
evidence  supporting  expression  and  function  is  emphasized. 

To  what  extent  can  we  use  proteomics  to  map 
the  platelet  proteome? 

Proteomic  technology  involves  either  fractionation  of  proteins, 
digestion  into  peptides  and  sequencing  by  mass  spectrometry 
or  immediate  protein  digestion,  fractionation  of  peptides  and 
then  sequencing  [Multidimensional  Protein  Identification 
Technology  (MuDPit)].  Purification  is  an  essential  step  in 
mapping  the  full  proteome  because  of  the  wide  range  of  protein 
levels  within  a  cell,  which  vary  by  up  to  8  orders  of  magnitude. 
As  a  consequence,  competition  for  sequencing  between 
peptides  will  favor  those  derived  from  highly  expressed  proteins 
over  other,  often  very  interesting  but  lower  expressed  proteins, 
such  as  kinases  and  receptors.  Significant  sequence  coverage  is 
also  only  achieved  for  proteins  that  are  present  in  high 
amounts,  and  this  coverage  is  seldom  more  than  50%. 
Depending  on  the  criteria  and  methodology,  the  largest  group 
of  proteins  is  often  those  identified  on  the  basis  of  a  single 
peptide  [1].  For  these  and  other  proteins  with  low  coverage, 
there  is  a  need  for  independent  confirmation  of  expression. 
Thus,  at  present,  we  are  a  long  way  from  achieving  the  goal  of 
mapping  all  proteins  within  a  cell. 

Protein  purification  plays  an  important  role  in  facilitating 
protein  identification.  The  more  concentrated  the  protein  in  the 
mixture,  the  more  likely  that  sequence  information  will  be 
obtained.  The  partial  or  full  purification  of  a  protein  sample, 
for  example  through  isolation  of  a  cellular  organelle  or  by 
affinity  chromatography,  can  have  a  dramatic  effect  on  protein 
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identification.  The  most  frequently  used  high  throughput 
platform  for  protein  identification  is  two-dimensional  (2D) 
gel  electrophoresis,  based  on  sequential  isoelectric  focussing 
(lEF)  and  sodium  dodecyl  sulfate-polyacrylamide  gel  electro¬ 
phoresis  (SDS-PAGE)  steps.  This  technique  not  only  favors 
identification  of  highly  expressed  proteins,  but  is  also  capable 
of  detecting  proteins  over  several  orders  of  magnitude  using 
high-resolution  2D  gels  and  very  sensitive  protein  dyes.  There 
are  however  special  problems  associated  with  2D  gel  electro¬ 
phoresis  in  identifying  hydrophobic,  basic  and  high-molecular- 
weight  proteins.  Both  ID  gel  and  non-gel-based  methods  can 
be  used  to  identify  these  groups  of  proteins,  but  with  the 
caveats  of  increasing  the  complexity  of  the  peptide  mixture  and 
favoring  identification  of  the  most  highly  expressed  proteins.  In 
addition,  this  approach  loses  valuable  information  on  p/  values 
which  facilitates  protein  identification  and  provides  informa¬ 
tion  on  possible  post-translational  modifications.  Alternative 
methodologies  that  identify  subgroups  of  peptides,  such  as 
those  that  contain  cysteine  residues,  have  the  advantage  of 
providing  a  less-complex  peptide  mixture  for  analysis,  although 
information  is  limited  by  the  number  of  peptides  recovered. 
Given  the  above  considerations,  several  complementary  pro- 
teomic  approaches  will  usually  be  required  to  map  the 
composition  of  a  complex  sample. 

There  is  a  considerable  potential  for  error  and  misinforma¬ 
tion  in  proteomics  that  can  severely  hamper  research.  This 
includes  errors  relating  to  sample  preparation  and  contamin¬ 
ation,  peptide  sequencing  and  database  analysis  as  illustrated 
below.  It  is  therefore  important  to  consider  the  methodological 
aspects  relating  to  platelet  preparation,  and  the  way  that  we 
should  present  and  interpret  proteomic  information.  The  latter 
is  of  particular  relevance  in  view  of  the  recent  plethora  of 
proteomic  publications  on  platelets  that  has  been  driven  by  the 
relative  ease  of  access  to  material  and  the  important  role  of 
platelets  in  atherothrombotic  and  inflammatory  disease.  It  is 
however  beyond  the  scope  of  this  article  to  provide  full 
experimental  details  on  the  way  to  prepare  and  analyze 
platelets  for  analysis,  and  there  is  also  unlikely  to  be  widespread 
agreement  on  this  in  view  of  the  differing  ways  that  platelets  are 
prepared  in  experimental  laboratories  throughout  the  world.  A 
number  of  the  factors  that  must  be  taken  into  consideration  in 
the  proteomic  analyses  are  highlighted  below. 

1  Platelet  preparation.  It  is  essential  to  minimize  the  degree  of 
platelet  activation  during  blood  collection  and  cell  isolation 
to  limit  activation-dependent  changes  in  post-translation 
modification,  e.g.  protein  phosphorylation.  Special  consid¬ 
eration  should  be  given  to  the  use  of  prostacyclin  and  other 
cAMP-elevating  agents  during  the  isolation  procedure  as 
their  use  will  generate  new  protein  forms  through  phos¬ 
phorylation. 

2  Protein  extraction.  Experimental  reactions  should  be  stopped 
as  quickly  as  possible  to  minimize  post-activation  changes  in 
proteins,  notably  degradation.  Reactions  should  be  termin¬ 
ated  rapidly,  e.g.  by  rapid  freezing,  and  a  cocktail  of  protease 
inhibitors  should  be  used,  although  this  complicates  the  use 
of  MuDPit  through  the  introduction  of  new  peptide  species. 


3  Platelet  purity.  The  degree  of  contamination  of  the  platelet 
sample  with  other  cell  types  and  plasma  should  be  kept  to  a 
minimum.  This  is  of  particular  significance  in  mapping 
proteins  that  are  expressed  at  low  levels,  as  contamination 
could  lead  to  identification  of  more  highly  expressed  proteins 
in  other  blood  cells  or  plasma.  Sample  purity  applies  also  to 
use  of  polymerase  chain  reaction  (PCR)-based  techniques  in 
analysis  of  gene  identification  in  platelets.  It  may  be  prudent 
to  use  the  same  method  of  platelet  preparation  in  proteomic 
and  real  time  (RT)-PCR-based  analyses,  especially  as  the 
latter  can  be  used  to  look  for  tissue-specific  genes  (and 
therefore  give  an  indication  of  purity). 

4  Endogenous  and  exogenous  proteins.  Platelets  have  the  ability 
to  take  up  proteins  from  the  surrounding  medium  via  surface 
receptors  and  through  endocytosis,  and  also  bind  plasma 
proteins  on  their  surface.  The  protein  composition  of  the 
platelet  and  profile  of  expressed  genes  are  therefore  distinct. 
Factors  that  influence  uptake  of  proteins  and  protein  binding 
(e.g.  buffer)  will  alter  the  composition  of  the  platelet 
proteome  and  must  be  appreciated  by  the  investigator. 

5  Peptide  sequencing.  Errors  can  accrue  in  peptide  sequencing 
as  a  result  of  ambiguous  patterns  of  fragmentation,  neces¬ 
sitating  use  of  stringent  criteria  on  which  to  base  acceptance 
of  expression.  However,  because  of  variations  between  the 
resolution  power  of  mass  spectrometers  and  differences  in 
methodology,  it  is  difficult  to  arrive  at  absolute  guidelines  on 
this  subject.  It  should  therefore  be  recognized  that  there  is  a 
potential  for  error  in  peptide  sequencing,  albeit  one  that  is 
low  ( <  5%)  [1],  and  that  this  is  governed  by  the  criteria  used 
for  acceptance  of  the  predicted  sequence.  The  potential  for 
error  can  be  lessened  by  manual  inspection  of  records  and 
through  use  of  other  supporting  information,  if  available, 
such  as  molecular  weight  and  p/  values.  There  is  also  the 
issue  of  splice  variants,  which  cannot  be  predicted  from 
genomic  databases. 

6  Protein  databases.  There  is  a  potential  for  error  in  searching 
protein  databases  such  as  NCBI,  SWISS-PROT  and 
TrEMBL.  For  example,  not  all  proteins  are  present  and 
the  databases  themselves  almost  certainly  contain  errors, 
although  considerable  efforts  are  being  made  to  correct  these 
and  other  groups  such  as  HuPO  and  IPI  are  attempting  to 
produce  better  databases.  Nevertheless,  the  predictive  value 
of  sequenced  peptides  does  not  take  into  account  factors 
such  as  alternative  splicing,  polymorphisms  and  post-trans- 
lational  modifications. 

How  should  we  use  proteomic  information? 

It  is  important  to  consider  the  extent  to  which  we  can  rely  on 
proteomic  information  to  establish  expression  of  a  protein  in 
a  cell.  For  highly  expressed  proteins,  this  is  not  a  major  issue, 
as  such  proteins  are  routinely  picked  up  in  a  variety  of 
proteomic-based  methods  and  through  repeat  studies,  with  a 
large  degree  of  supporting  peptide  sequence  information 
being  obtained.  It  is  generally  accepted  that  a  sequence  of 
eight  amino  acids  or  more  is  sufficient  to  serve  as  a  unique 
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identifier  of  a  protein,  although  not  all  peptides  of  this  length 
will  map  to  a  single  protein.  Bearing  in  mind  the  potential 
for  a  sequence  error,  however,  an  increased  degree  of 
confidence  in  protein  identification  is  gained  through  iden¬ 
tification  of  two  or  more  unique  peptides  of  this  length.  The 
probability  of  obtaining  two  different  sequences  for  the  same 
protein  from  a  sample  via  sequencing  errors  is  extremely  low, 
although  this  is  dependent  on  the  stringency  of  the  accept¬ 
ance  criteria  for  a  peptide. 

Special  consideration  needs  to  be  given  to  proteins  that 
have  been  identified  on  the  basis  of  a  single  peptide,  the 
so-called  ‘one-hit  wonders’  [1].  This  group  of  proteins  often 
makes  up  the  largest  group  of  identified  proteins  in  a  study. 
While  it  is  important  to  report  this  information  available,  it  is 
even  more  critical  that  such  data  be  published  with  the  clear 
statement  that  supporting  information  is  required  to  confirm 
expression.  The  field  will  move  backward,  not  forward,  if  it 
has  to  spend  an  inordinate  amount  of  time  correcting 
mistakes. 

Current  status  of  platelet  proteomic  studies 

Several  reports  on  the  mapping  of  the  platelet  proteome  have 
now  been  published,  most  notably  over  the  last  twelve  months. 
These  reports  have  used  a  variety  of  approaches,  including 
classical  2D  gel  electrophoresis  [2,3],  combined  fractional 
chromatography  (COFRADIC),  gel  chromatography  based 
on  labelling  of  specific  amino  acids  or  regions  in  the  protein, 
e.g.  N-terminal  labelling  [4,5],  or  through  use  of  MuDPit, 
which  avoids  the  need  for  gel  chromatography  [6,7].  The  largest 
number  of  proteins  identified  in  a  single  approach  is  just  over 
400,  via  2D  gel  chromatography  using  narrow-range  IFF  gels 
[2,3].  The  other  approaches  have  identified  80-250  proteins. 
Flowever,  while  the  quantity  is  impressive,  the  relative  quality 
of  the  data  from  these  different  approaches  will  only  be 
understood  over  time. 

There  is  a  considerable  degree  of  overlap  in  the  types  of 
proteins  that  have  been  found  in  these  studies.  This  includes 
groups  of  proteins  that  are  expressed  at  high  level,  such  as  those 
involved  in  the  regulation  of  the  cytoskeleton  and  in  protein 
synthesis.  Proteins  are  also  carried  over  from  the  megakaryo¬ 
cyte  and  this  may  be  of  particular  relevance  for  the  latter  group, 
since  platelets  have  a  minimal  capacity  for  supporting  protein 
synthesis.  These  studies  have  also  identified  many  unique 
proteins,  some  of  which  have  only  previously  been  recognized 
as  hypothetical  proteins  in  the  database,  and  so  provided  first 
reported  expression  in  any  cell.  Flowever,  many  of  these 
identifications  have  been  based  on  a  single  peptide  sequence 
necessitating  extreme  caution  as  discussed  above. 

An  impressive  feature  of  these  studies  is  the  number  of 
proteins  that  have  not  been  previously  reported  in  platelets, 
which  is  estimated  to  be  in  the  order  of  50%  of  those 
identified.  On  the  contrary,  it  is  of  concern  that  several  major 
regulatory  groups  of  proteins  are  absent,  such  as  G  protein- 
coupled  receptors  and  key  signaling  molecules,  e.g.  phos¬ 
pholipases  and  tyrosine  kinases.  This  is  likely  to  reflect  their 


low  level  of  expression  as  well  as  other  factors  that  interfere 
with  some  methodologies,  e.g.  the  hydrophobicity  of 
membrane  proteins  is  a  recognized  problem  in  2D  gel 
electrophoresis. 

Recommendation  for  mapping  of  the  platelet  proteome 

It  is  not  possible,  at  present,  to  establish  a  specific  set  of  criteria 
on  which  to  base  acceptance  of  protein  expression  through 
proteomics  because  of  the  widespread  variation  in  factors  such 
as  the  sensitivity  of  mass  spectrometers,  mode  of  sample 
preparation  and  so  forth.  This  has  been  discussed  in  a  recent 
editorial  in  Molecular  and  Cellular  Proteomics,  which  also 
provides  important  guidelines  on  the  way  that  proteomic 
information  should  be  presented  [8].  We  are  in  full  support  of 
the  adoption  of  these  guidelines  in  the  reporting  of  platelet 
proteomic  information.  Flowever,  until  strict  criteria  are 
established  on  which  to  base  acceptance  of  expression,  the 
Platelet  Physiology  Subcommittee  strongly  recommends  that 
information  derived  from  other  techniques  be  used  to  support 
proteomic  data.  The  information  on  protein  expression  should 
be  compiled  into  a  database  as  follows: 

1  Proteins  that  are  known  to  have  a  functional  role  in  platelet 
regulation.  There  are  many  proteins  that  are  known  to  play 
an  important  role  in  platelet  biology,  whether  or  not  they 
have  been  identified  through  proteomics.  Such  information 
has  been  obtained  through  studies  on  mutant  mice,  use  of 
pharmacological-specific  agents  etc.  Examples  of  proteins  in 
this  class  include  G  protein-coupled  receptors  and  signalling 
proteins. 

2  Proteins  of  unknown  function,  but  for  which  expression  is 
beyond  doubt.  Expression  of  many  proteins  in  platelets  has 
been  established  through  protein  chemistry  and  related 
techniques,  such  as  flow  cytometry  and  radioligand  binding, 
even  though  their  function  remains  unclear.  This  group 
almost  certainly  includes  proteins  that  have  been  ‘carried 
over’  from  the  megakaryocytes,  which  may  or  may  not  have 
a  functional  role  in  platelets,  e.g.  transcription  factors. 

3  Proteins  identified  only  by  proteomic  andjor  PCR-based 
techniques.  The  largest  group  of  proteins  have  been  identified 
solely  on  the  basis  of  proteomic  and  genomic  technologies. 
In  such  cases,  the  need  for  independent  supporting  infor¬ 
mation  on  expression  such  as  Western  blotting  of  demon¬ 
stration  of  function  is  necessary. 

The  database  will  serve  to  guide  development  of  platelet- 
specific  antibody  and  gene  microarrays  that  could  be  used  to 
study  individuals  with  platelet-  and  megakaryocyte-related 
disorders.  These  platelet-specific  arrays  have  several  advantages 
over  the  use  of  proteomics  as  a  high  throughput  platform  for 
patient-based  studies,  including  a  focus  on  proteins  that  are 
known  to  play  important  roles  in  regulating  platelet  reactivity. 

Future  developments 

It  is  interesting  to  speculate  on  the  number  of  proteins  that  are 
expressed  in  platelets,  and  the  proportion  of  these  that  are 
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likely  to  be  of  functional  importance.  The  number  of  proteins 
corresponding  to  separate  gene  products  that  have  been 
identified  in  platelets  using  proteomics  is  approaching  1,000, 
which  is  a  considerable  achievement.  However,  this  is  a 
fraction  of  the  number  of  platelet-based  genes  predicted  using 
microarrays,  which  is  in  the  order  of  4—6000  [9-11],  although 
not  all  of  these  mRNAs  may  have  been  translated.  Proteomics 
has  also  failed  to  identify  many  important  platelet  proteins, 
including  surface  receptors,  protein  kinases  and  signalling 
proteins,  and  so  this  technology  presently  falls  far  short  of 
serving  as  a  high-throughput  approach  for  identifying  platelet 
proteins. 

The  analysis  of  post-translational  modifications  of  proteins 
in  basal  and  activated  platelets  brings  in  a  further  level  of 
complexity  in  the  analysis,  but  has  the  important  potential  to 
provide  novel  information  on  factors  such  as  degradation  and 
protein  phosphorylation  [12].  Analysis  of  non-activated  platelet 
by  2D  gel  electrophoresis  has  indicated  that  expressed  genes 
are,  on  average,  represented  by  2.5  features  (i.e.  distinct  spots 
that  are  detected  with  the  dye),  although  this  ratio  is  likely  to  be 
much  higher  in  a  stimulated  cell  (and  will  also  be  dependent  on 
the  nature  of  the  stimulus).  Even  under  basal  conditions,  a 
small  number  of  proteins  could  be  resolved  into  as  many  as  ten 
separate  features  or  more  suggesting  extensive  protein  modi¬ 
fications  [3].  It  will  take  a  considerable  time  and  effort  to  map 
the  nature  of  these  changes,  and  we  are  far  from  being  able  to 
achieve  this  at  present. 

A  further  issue  is  that  of  quantitative  proteomics.  This  is 
likely  to  be  the  major  way  of  applying  the  technology  to 
patient  samples,  for  example  in  analyzing  specific  diseases  and 
providing  a  peptide  map  that  could  be  used  in  phenotyping 
patients.  There  are  also  a  number  of  technologies,  including 
isotope-coded  affinity  tag  (ICAT),  mass-coded  abundance 
tagging  (MCAT)  and  '*0-labeling,  that  could  be  used  for 
investigating  the  effect  of  drug  treatment  by  comparing 
samples.  However,  the  ultimate  goal  is  to  compare  samples 
from  individual  patients  which  would  require  absolute 
quantitation.  We  are  a  considerable  way  from  being  able  to 
achieve  this  at  present,  and  it  is  highly  likely  that  other 
techniques  such  as  antibody  arrays  will  also  have  to  be  used 
to  achieve  this. 

An  urgent  immediate  goal  is  to  establish  a  database  that  will 
describe  proteins  expressed  in  platelets  and  the  way  they  change 
upon  activation  and  in  disease.  The  Platelet  Physiology 
Subcommittee  recommends  the  setting  up  of  a  database 
overseen  by  a  subgroup  to  evaluate  evidence  for  protein 
expression  and  modification,  with  all  related  information 
maintained  in  the  public  domain.  It  will  be  important  for  the 
database  to  be  maintained  in  collaboration  with  the  widely 
available  and  established  databases  such  as  SwissProt,  HuPO 
and  the  International  Protein  Index  of  EMBL-EBI.  The  need 
for  caution  is  stressed,  but  the  database  will  serve  as  a  powerful 
resource  to  further  our  understanding  of  platelet  biology. 
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